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SPECIFICATION 

TITLE OF THE INVENTION 

Transgenic rice plant and its family with environmental 
stress resistant by proline accumulation of high level 
and its production 

BACKGROUND OF THE INVENTION 

The present invention relates to a rice plant 
having a high level of proline accumulating ability, 
and improved salinity- tolerance , drought- tolerance , and 
low temperature-tolerance, and its production method. 

It is known that, for several plants including 
halophytes , when the plants are subjected to a high 
salinity stress or a drought stress, they accumulate 
proline, which is one of amino acids, in their 
cytoplasms. This is considered useful for regulating 
the osmotic pressure in the plant cytoplasm, or 
inhibiting the degradation of a functional protein due 
to the stress. The proline in a plant is synthesized 
from a glutamic acid by two enzymes of a A^-pyrroline- 
5-carboxylate (P5C) synthetase (P5CS) and a P5C 
reductase. On the other hand, proline is degraded into 
a glutamic acid by the two enzymes of a proline 
dehydrogenase (ProDH) and a P5C dehydrogenase. 

When each of the aforesaid plants is subjected 
to a water stress (the state in which water is 
difficult to absorb) such as a high salinity stress or 
a drought stress, the expression level of the P5CS gene 



is increased to activate the P5CS. However, the P5CR 
activity and the gene expression are constant at a low 
level. Further, the gene expression and the enzyme 
activity related to metabolism are also in the 
inhibited states. However, once the water stress has 
been removed, conversely, this time, the gene 
expression and enzyme activity related to biosynthesis 
are inhibited, so that the expression of the ProDH gene 
is rapidly induced, and the enzyme activity is also 
enhanced. As a result, the proline accumulated in the 
cytoplasm is rapidly metabolized to a glutamic acid. 

From the foregoing description, it is 
considered that the P5CS becomes rate-limiting for 
proline synthesis under a water stress. Whereas, the 
ProDH becomes rate-limiting for proline metabolism 
after releasing the water stress (Yoshida et al.. Plant 
Cell Physiol, 38: 1095 - 1102 (1997)). 

SUMMARY OF THE INVENTION 

It is predicted that food shortage due to an 
expansion of the saline soil area caused by drought and 
semi-drought with the deterioration of global 
environment, and population growth will become 
increasingly more serious in the future. Researches 
have been pursued in diversified fields respectively on 
the breeding of crop plants resistant to a high 
salinity stress, a drought stress, and a low 
temperature stress (the state in which water is 



difficult to absorb) as those playing an important role 
in solving the world food problem, and the results are 
expected to be promising. 

It is an object of the present invention to 
provide a rice plant which has a high proline 
accumulating ability, and accordingly has improved 
salinity-tolerance, drought-tolerance, and low 
temperature-tolerance by focusing attention on the 
importances of a A^-pyrroline-5-carboxylate (P5C) 
synthetase {P5CS) and a proline dehydrogenase (ProDH) 
which are the rate-limiting enzymes related to 
synthesis and metabolism of proline in plants, and 
regulating the expression of genes for the enzymes with 
a gene recombination technology, and its production 
method . 

The P5CS gene related to proline synthesis is 
introduced to be overexpressed ; the antisense (reverse 
DNA sequence-containing) gene of the ProDH gene related 
to the metabolism is introduced to inhibit the 
degradation of proline; or both the P5CS gene and the 
antisense gene of the ProDH gene are introduced to 
promote the proline synthesis while inhibiting the 
degradation of proline. As a result, proline is 
accumulated with a high concentration in the cells of 
rice and a rice plant. 

In the present invention, by accumulation of 
proline at a high concentration, it becomes possible to 
perform molecular breeding of rice and a rice plant 



having salinity-tolerance, drought-tolerance, or low 
temperature-tolerance . 

Heretofore, there is known no report that an 
increase in concentration of proline as an 
5 osmoprotectant is allowed by synthesis promotion and 
degradation inhibition in rice and a rice plant. The 
inventors of the present invention have focused 
attention on the importances of the P5CS gene and the 
ProDH gene. Then, in order to solve novel technical 
^ 10 problems which have not been known in the prior art, 
^ they have conducted studies from various fields 

Bl including the study on the selection of the rice 

% 

variety into which the gene is easily introduced, the 
study for improving the callus formation rate, the 
IJ 15 study on the construction of a vector for introducing 
r=2 the gene for rice, and the like. In consequence, they 

^ have provided novel technical elucidation, resulting in 

the completion of the present invention. 

In the present invention, there are provided a 
20 rice plant transformed by introducing therein the 

proline synthesis gene and the antisense gene of the 
proline metabolism gene derived from rice or 
Arabidopsis thaliana individually or in combination, 
and its production method. 
25 In the rice plant of the present invention, 

either or both of the gene encoding the synthetase 
protein of proline which is one of amino acids and the 
antisense gene of the proline dehydrogenage have been 



4 



introduced. With this construction, it is possible to 
implement a rice plant having improved salinity- 
tolerance, drought- tolerance , and low temperature- 
tolerance. Further, the mature rice seeds gathered 
5 from the rice plant of the present invention, 

particularly the rice seeds are characterized by 
keeping a high proline accumulating ability over a 
plurality of generations. 
m Further, the present invention is targeted for 



ru 



rice and rice plants. The targets have no particular 
^ restriction as long as they are the plants belonging to 

pi the rice plants. Examples of the plants belonging to 

%J 

the rice plants include rice, corn, wheat, barley, rye, 
turf, millet, and barn grass. In particular, the 
flJ 15 present invention can be more preferably applied to 



rice . 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGS. lA to ID are diagrams respectively 
showing the vectors for rice in which proline 
synthesis-related enzyme P5CS genes and proline 
metabolism-related enzyme ProDH genes, and antisense 
genes thereof have been respectively incorporated; 

FIG. 2 is a graph showing the amount of proline 
accumulated in rice lines under no stress in which the 
vectors shown in FIGS. lA to ID have been respectively 
introduced by genetic engineering; and 

FIG. 3 is a graph showing the salinity- 
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tolerance of each of the transgenic rice lines in which 
the proline-related genes have been respectively 
incorporated shown in FIG. 2. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
In rice plants of examples of the present 
invention, either or both of the proline 
(osmoprotectant) synthesis gene and the antisense gene 
of the proline metabolism derived from rice or 
Arabidopsis thaliana gene have been introduced for 
transformation . 

Examples of one type of gene to be introduced 
to the rice plants of the examples of the present 
invention include: (1) a P5CS ( A-^-pyrroline-5- 
carboxylate (P5C) synthetase) gene of rice containing 
the sequence (DNA sequence and amino acid sequence) 
according to SEQ ID No. 1; (2) a P5CS { A -^-pyrroline-S- 
carboxylate (P5C) synthetase) gene of Arabidopsis 
thaliana containing the sequence (DNA sequence and 
amino acid sequence) according to SEQ ID N2 ; and (3) 
the antisense (reverse DNA sequence-containing) gene of 
the ProDH (proline dehydrogenase) gene of Arabidopsis 
thaliana containing the sequence (DNA sequence and 
amino acid sequence) according to Seq ID NO. 3. 

Examples of the two types of genes to be 
introduced into the rice plants of the examples of the 
present invention include: 

(1) Two genes of the P5CS ( A ^-pyrroline-5-carboxylate 



(P5C) synthetase) of rice containing the sequence 
according to SEQ ID NO. 1 or the P5CS gene of 
Arabidopsis thaliana containing the sequence according 
to SEQ ID NO. 2, and the antisense (reverse DNA 
5 sequence-containing) gene of the ProDH (proline 

dehydrogenase) gene of Arabidopsis thaliana containing 
the sequence according to SEQ ID NO. 3; and 
(2) Tandemly connected two genes of the P5CS (A-^- 
'Z pyrroline-5-carboxylate (P5C) synthetase) gene of rice 

O 10 containing the sequence according to SEQ ID NO. 1 or 

w- 

m the P5CS gene of Arabidopsis thaliana containing the 

sequence according to SEQ ID NO. 2, and the antisense 

"'4 (reverse DNA sequence-containing) gene of the ProDH 

(proline dehydrogenase) gene of Arabidopsis thaliana 

hi 

--y 15 containing the sequence according to SEQ ID NO. 3. 

'^4 In each of the vectors to be used in the 

O 

1^ examples of the present invention, there is 

incorporated any one gene of the P5CS ( A -"--pyrroline-S- 
carboxylate (P5C) synthetase) gene of rice containing 

20 the sequence according to SEQ ID NO. 1, the P5CS gene 
of Arabidopsis thaliana containing the sequence 
according to SEQ ID NO. 2, and the antisense (reverse 
DNA sequence-containing) gene of the ProDH (proline 
dehydrogenase) gene of Arabidopsis thaliana containing 

25 the sequence according to SEQ ID NO. 3. Alternatively, 
there are incorporated two genes of the P5CS gene of 
rice or Arabidopsis thaliana, and the aforesaid 
antisense gene in tandemly connected relation to each 
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other. 

The rice plants of the examples of the present 
invention can be obtained by, for example, any of the 
following methods. 

(1) The aforesaid vector is introduced into the calli 
derived from a rice plant, and the calli are grown. 
Then, a plant body is regenerated from the calli; 

(2) The aforesaid vector is introduced into the 
protoplast derived from a rice plant, and a plant body 
is regenerated from the colony obtained by growing the 
protoplast; and 

(3) Crossing with the rice plants obtained by 
introducing the vector therein by genetic engineering 
is carried out. 

Examples of the production method of the rice 
plants of the examples of the present invention include 
the following methods : 

(1) The aforesaid vector is introduced into the calli 
derived from a rice plant by using Agrobacterium 
tumefaciens, and the calli are grown. Then, a plant 
body is regenerated from the calli; 

(2) The aforesaid vector is introduced into the 
protoplast derived from a rice plant by electroporation , 
and a plant body is regenerated from the colony 
obtained by growing the protoplast; and 

(3) Crossing with the rice plants obtained by 
introducing the vector therein by genetic engineering 
is carried out. 



These production methods provide a rice plant 
having a high proline accumulating ability, and having 
improved salinity-tolerance, drought- tolerance , and low 
temperature-tolerance levels. 

Further, mature seeds gathered from the rice 
plants of the examples of the present invention, 
particularly the rice seeds will maintain their high 
proline accumulating abilities over a plurality of 
generations . 

The rice plants of the examples of the present 
invention and its production method will be described 
in details by way of embodiments thereof by using rice 
as a typical example step by step below. It is 
needless to say that the steps described below are 
applicable to other rice plants than rice with or 
without changing the various conditions. 
(Gene cloning) 
First, a mRNA is extracted from a rice seedling. 
A cDNA is synthesized by using the mRNA. The cDNA is 
combined with a vector made of a plasmid or a phage, 
and introduced into E. coli to prepare a recombinant 
DNA. The resulting trans formant in which the 
recombinant DNA has been introduced is subjected to 
screening by plaque hybridization using the P5CS gene 
from Arabidopsis thaliana as a probe. The sequences of 
the P5CS genes from rice and Arabidopsis thaliana have 
been already reported (Yoshiba et al. , Plant J. (1995) 
7:751-760, and Igarashi et al.. Plant Mol. Biol. (1997) 



33:857-865). Based on these reports, appropriate 
primers are designed, and subjected to screening by PGR 
to select a target transf ormant . A target plasmid is 
isolated from the transf ormant obtained- If required, 
it is cut with an appropriate restriction enzyme, and 
subjected to subcloning in a plasmid vector for cloning. 
It is also possible to subject the P5CS gene of 
Arabidopsis thaliana to cloning in the same manner as 
with rice. However, as a sample from which a mRNA is 
to be extracted, the one subjected to a high salinity 
stress (immersed in a 250 raM NaCl solution or the like) 
or the one subjected to a drought stress treatment is 
more preferable than the one bred under a normal 
environment. This is because the P5CS gene is induced 
in response to a water stress such as a high salinity 
stress or a drought stress (Yoshiba et al. , Plant J. 

(1995) 7: 751-760, Igarashi et al . , Plant Mol . Biol. 

(1997) 33: 857-855, and Yoshiba et al.. Plant Cell 
Physiol. (1997) 38: 1095-1102). 

On the other hand, it is also possible to 
subject the ProDH gene of Arabidopsis thaliana (its 
sequence has already been reported in Kiyosue et al.. 
Plant Cell (1996) 8:1323-1335) to cloning in the 
foregoing manner. However, as the sample from which a 
raRNA is to be extracted, there may be used the one 
which has been subjected to a drought stress (about 10- 
hour treatment) , then immersed in water again, and 
allowed to absorb water, the one which has been 



immersed in a proline solution, and allowed to absorb 
proline, or the like. This is due to the following 
fact. Namely, the ProDH gene is inhibited from its 
expression under a water stress, and the gene 
expression is induced by a high concentration of 
proline (Kiyosue et al. , Plant Cell (1996) 8: 1323-1335, 
and Toshiba et al.. Plant Cell Physiol. (1997) 38: 
1095-1102) . 

If the samples as described above are used, it 
is possible to isolate the P5CS gene and the ProDH gene 
not only from rice or Arabidopsis thaliana but also 
from other rice plants. 

(Construction of gene introduction vector) 
Respective P5CS genes and ProDH genes subjected 
to cloning are cut from plasraids with appropriate 
restriction enzymes, and, as shown in FIGS. lA to ID, 
each is combined behind the 35S promoter of a 
cauliflower mosaic virus of a vector for rice obtained 
by raodifying a pBI vector. In FIGS. lA to ID, RB 
denotes the right border, 35SPro denotes the promoter 
of a cauliflower mosaic virus, P5CS denotes the proline 
synthesis-related enzyme gene of rice or Arabidopsis 
thaliana, ProDH denotes proline metabolism-related 
enzyme gene of Arabidopsis thaliana, Noster denotes the 
terminator of a nopaline synthetase gene, HTP denotes a 
hygromycine resistant gene, and LB denotes the left 
border. Whereas, each of the arrows indicates the 
orientation of the sense of each gene. 



In FIGS. lA to ID, FIG. lA is a diagram showing 
an example of the vector (construct) so constructed 
that the sequence in the order of RB-35SPro-P5CS- 
Noster-SSSPro-HTP-Noster-LB has been achieved. FIG. IB 
is a diagram showing an example in which, with respect 
to FIG. lA, the same sequence in the order of RB- 
35SPro-P5CS-Noster-35SPro-HTP-Noster-LB as in the 
construct of FIG. lA has been achieved, but the gene 
P5CS has been sequenced in antisense orientation. FIG. 
IC is a diagram showing an example in which the gene 
ProDH has been sequenced in antisense orientation, and 
substituted for the gene P5CS of- the construct of FIG. 
lA, to construct a vector with a sequence in the order 
of RB-35SPro-ProDH (antisense) -Noster-35SPro-HTP- 
Noster-LB. FIG. ID is a diagram showing an example in 
which, to the construct of FIG. lA, the gene ProDH has 
been further sequenced in antisense orientation, and 
the construct shown in FIG. IC has been further 
connected thereto in tandem, to construct a vector with 
a sequence in the order of RB-35SPro-P5CS-Noster- 
35SPro-ProDH (antisense) -Noster-35SPro-HTP-Noster-LB . 

The 35S promoter is well known as a promoter 
which is strong and invariably induces the gene 
expression in any tissue. As for the orientation in 
which the gene is incorporated, the P5CS gene is 
connected in the sense orientation, and the ProDH gene 
in the antisense orientation. 

Then, each vector to which each of the genes 



has been connected is introduced into Agrobacterium 
tumefaciens EHA 101 by electroporation. The 

Agrobacterium tumefaciens in which each construct (FIGS 
lA to ID) has been introduced is cultured and grown in 
5 a YEP medium containing Bacto Pepton (10 g/1) , Bacto 
Yeast Extract (10 g/1) , sodium chloride (5 g/1) , IM 
magnesium chloride (2 ml/1) , and hygromycine B (50 
mg/1) at 28 °C . Gene introduction is carried out by 
'f=% infecting the callus cell of rice with the 

'■^ 10 Agrobacterium tumefaciens into which each construct 
in (FIGS. lA - ID) has been introduced. The construct D 

H 

pi is so designed that the two genes (the P5CS gene and 

"^'^ the ProDH gene) are connected to each other in tandem 

N" to be simultaneously introduced. However, even if the 

rlj 15 constructs A and C are mixed for coinfection, it is 
pf also possible obtain the same effects as with the 

H" construct D. 

Incidentally, a HPT (hygromycine resistant) 
gene is connected to each construct. This is for 
20 efficiently selecting the cell and plant body 

transformed for the basic research on analysis of the 
effects of the introduced genes. Therefore, the HPT 
gene is not required to be incorporated therein for 
actual cultivation on the salt damaged land or the dry 
25 land. 

(Induction of rice calli for gene introduction) 

Mature rice seeds are sterilized with 70 % 
ethyl alcohol for 10 minutes, and with 3 % sodium 
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hypochlorite for 1 hour after stripping the hulls 
therefrom. After sterilization, the seeds are washed 
with sterilized water 3 times, and bedded on a pH 5.8 
N6 medium (2N6 medium) containing 1 g/1 casamino acid, 
5 30 g/1 sucrose, 2 mg/1 2 , 4-dichlorophenoxyacetic acid, 
and 2 g/1 Gelrite, and cultured at 28 in the dark 
for 3 to 5 weeks. 

{Gene introduction into rice calli) 
Out of the rice calli induced in the foregoing 
10 manner, the ones with a size of 1 to 3 mm are bedded on 
'f, the 2N6 medium again, and cultured at 28 13 in the dark 

^" for 3 to 4 days. As a result, it is possible to 

enhance the division activity of the callus cell. The 
L. gene introduction is carried out by mixing the cultured 

fy 15 calli and a solution of each construct-introduced 
rg Agrobacteriura tumefaciens grown in the YEP medium (the 

solution diluted so that the concentration of the 
bacteria is 0.1 as determined at OD 660nm) for 
infection. Thereafter, the calli are cultured at 25 °C 
20 in the dark for 3 days. After cultivation, the calli 

are washed and sterilized several times by a cefotaxime 
aqueous solution with a concentration of 1 mg/4 ml to 
remove extra bacteria attached to the surfaces of the 
calli, and cleaned with a sterilized kim towel or the 
25 like. Subsequently, it is bedded on a 2N6 medium 
(secondary selection medium) containing 250 mg/1 
cefotaxime and 10 mg/1 hygromycine B, and cultured at 
2 8 °C in the dark for 1 week. 
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(Selection of transformed calli and 
regeneration of plant body) 

The calli cultured in the medium containing 
cefotaxime is bedded on a medium (secondary selection 
medium) in which the content of hygromycine B has been 
increased to 30 mg/1, and cultured at 28 °C in the dark 
for 3 weeks. Thereafter, the calli are transferred to 
a pH 5.8 MS medium (regeneration induction medium) 
containing 30 g/1 sucrose, 30 g/1 sorbitol, 2 g/1 
casamino acid, 11 g/1 MES buffer, 2 mg/1 NAA, 1 mg/1 
kinetin, 250 mg/1 cefotaxime, 30 mg/1 hygromycine B, 
and 4 g/1 Gelrite, and cultured in the bright place at 
28 °C for 3 week. The gene-introduced calli form a 
green spot, from which shoots and roots are regenerated. 
The regenerated calli are further transferred to a pH 
5.8 MS medium (plant body formation medium) containing 
30 g/1 sucrose, 250 mg/1 cefotaxime, 30 mg/1 
hygromycine B, and 8 g/1 agar, from which plant 
hormones have been removed, and cultured in the bright 
place at 28 °C for several weeks. In consequence, the 
plant body is bred more largely. 

(Breeding of transformed rice plant body and 
seed formation) 

Upon having grown to a seedling height of about 
4 to 5 cm in a petri dish, the regenerated rice is 
transferred to a planter in which the soil for raising- 
seedling is placed. Then, it is bred in an artificial 
climate system with an illuminance of about 20,000 Ix 



under a temperature condition of 28 °C until the fourth 
leaf to the fifth leaf develop. Subsequently, the 
seedling is further transferred into a pot containing 
the soil into which a fertilizer has been appropriately 
added, and bred in a greenhouse until the seeds ripen. 
Assuming that the present generation of the plant body 
regenerated is of the TO generation, and that the seeds 
obtainable from this plant body is of the Tl generation, 
the ones of the T2 to T3 generations are bred. When 
they are cultivated in an actual farm land, they are 
required to be commercialized after carrying out the 
various safety evaluation tests over further 
generations, and confirming the safety. 

(Extraction of proline from transformed rice 
and concentration measurement thereof) 

Proline is extracted from the leaves of the 
seedling (whose forth leaf has developed) of the 
transformed rice of the T2 generation or the T3 
generation. The leaves of the rice seedling bred in 
the artificial climate system are cut off in an amount 
of about 200 mg by scissors or the like. Then, in a 
mortar, liquid nitrogen is added thereto, and the 
leaves are ground into powder. The resulting sample in 
powder form is mixed with pure water, and further 
milled by means of a homogenizer or the like. The 
milled sample is heated at 97 for 6 minutes, and 
then ice cooled. The sample is then centrifuged at 
about 17 , 000 Xg for 10 minutes at 4 °C to separate the 



supernatant. To the supernatant obtained, a 
trichloroacetic acid is added and mixed so that the 
final concentration is 5 %. The resulting mixture is 
then centrifuged at about 17,000 XG for 10 minutes at 
4 °C again to precipitate protein. Proline as an 
osmoprotectant is contained in the supernatant at this 
step, and the concentration thereof is determined by 
means of high performance liquid chromatography (HPLC) . 
The qualitative determination of proline is carried out 
in the following manner. The solutions in which 
various amino acids have been dissolved to a given 
concentration are previously determined by HPLC. The 
amount of proline contained in the leaf of an actual 
transgenic rice is determined based on the retention 
times . 

FIG. 2 shows the proline content of each of the 
transgenic rice lines under no stress into which 
various genes have been introduced. The hollow graphs 
in the leftmost column represent control samples into 
which proline-related genes have not been incorporated. 
Whereas, the solidly shaded graphs in the right-hand 
five columns denote respective transgenic rice lines 
into which proline-related genes have been incorporated. 
It is indicated that the proline content varies 
according to the type of the gene introduced. 

There is observed almost no accumulation for 
each sample in which the P5CS gene (OsPSCS) of rice has 
been introduced in antisense orientation (FIG. IB) in 



the second column from left. For each sample in which 
the P5CS gene (AtPSCS) of Arabidopsis thaliana has been 
introduced in sense orientation (FIG. lA) in the third 
column from left, there is observed an increase in 
amount of proline accumulated over the control samples. 
Similarly, for each sample in which the ProDH gene 
(AtProDH) of Arabidopsis thaliana has been introduced 
in antisense orientation (FIG. IC) and each sample in 
which the P5CS gene (OsP5CS) of rice has been 
introduced in sense orientation (FIG. lA) in the fourth 
and fifth columns from left, respectively, there are 
observed increases in amount of proline accumulated 
over the control sample.- In contrast to these, for 
each sample in which the P5CS gene (OsPSCS) of rice has 
been introduced in sense orientation, and the ProDH 
gene (AtProDH) of Arabidopsis thaliana in antisense 
orientation in the rightmost column, there is observed 
a considerably larger amount of proline accumulated 
(100 times or more with respect to the control sample 
for the case where the amount of proline accumulated is 
larger) as compared with each of the aforesaid samples 
in which one type of gene has been introduced. Then, 
it is indicated that each sample of OsP5CS (in the 
fifth column from left) is slightly more effective for 
proline accumulation than each sample of AtPSCS (in the 
third column from left) among the samples in which 
genes have been introduced in sense orientation. 

(Salinity tolerance test and improvement of 
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salinity tolerance of transgenic rice) 

FIG. 3 shows the results of a salinity 
tolerance test performed at a 250 rtiM concentration 
(about half the salt concentration of sea water) by 
using several lines of the transgenic rice for which 
proline accumulation has been observed shown in the 
right hand four columns of FIG. 2. The hollow graphs 
denote the control samples in which proline related 
genes have not been incorporated. Whereas, the solidly 
shaded graphs denote the transgenic rice samples. The 
salinity tolerance test was carried out in accordance 
with the testing method using known survival rates as 
indexes (Japanese Published Unexamined Patent 
Application No. Hei 09-266726, title of the invention: 
evaluation of salt resistance of plant) . It has been 
shown that the control samples in which proline-related 
genes have not been introduced die 5 days after a salt 
treatment, while the transgenic rice samples which 
accumulate proline show high survival rates, i.e., 95 % 
for the third day, and 65 % even after the five-day 
treatment. This indicates that the salinity tolerance 
can be improved by transforming rice, and thereby 
enhancing the proline accumulating ability thereof. 

Therefore, if the gramineous crop produced 
according to the present invention is subjected to 
breeding by further pursuing detailed analysis such as 
the safety evaluation thereon, it becomes capable of 
being cultured in the salt accumulated soil or the 



desertified soil. Therefore, food productivity can be 
expected to be improved. Further, it can be largely 
expected that the crop plant is also capable of coping 
with the population growth in developing countries. 

In accordance with the present invention, it 
has become possible to produce a transgenic rice plant 
having an enhance proline accumulating ability. 
Further, for the rice plant produced by the method of 
the present invention, the amount of proline 
accumulated therein has been increased, so that it has 
become possible to improve the salinity tolerance level 
thereof. 
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<212> DNA 

<213> Oryza sativa L. 

<220> 

<221> CDS 

<222> 99. . 2249 

<300> 

<301> Yuraiko Igarashi, Yoshu Yoshiba, Yukika 
Sanada, Kazuko Yamaguchi-Shinozaki, Keishiro Wada, 
Kazuo Shinozaki 

<302> Characterization of the gene for A ^- 
pyrroline-5-carboxylate synthetase and correlation 
between the expression of the gene and salt 
tolerance in Oryza sativa L. 
<303> Plant Molecular biology 



2 1 



<304> 33 

<306> 857-865 

<307> 1996-12-03 

<308> D49714 

<309> 1995-03-16 

<400> 1 

gcggctgcgg cggcaaggcg gcgagacgtg ggagagggat ttacaggtag agggagaggg 60 

tggaggagga gaggctgagg ctaggaagcg gtttcgcc atg gcg age gtc gac ccg 116 
% Met Ala Ser Val Asp Pro 

% 1 ^ 

%| tec egg age ttc gtg agg gae gtg aag cgc gte ate ate aag gtg gge 164 

Ser Arg Ser Phe Val Arg Asp Val Lys Arg Val He He Lys Val Gly 
fy 10 15 20 

ru 

p act gca gtt gtc tec aga caa gat gga aga ttg get ttg gge agg gtt 212 

Thr Ala Val Val Ser Arg Gin Asp Gly Arg Leu Ala Leu Gly Arg Val 
25 30 35 

gga get ctg tge gag cag gtt aag gaa ctg aac tet tta gga tac gaa 260 
Gly Ala Leu Cys Glu Gin Val Lys Glu Leu Asn Ser Leu Gly Tyr Glu 
40 45 50 

gtg att ttg gtc acc tea ggt get gtt gga gtg ggg cga cag ega ett 308 
Val He Leu Val Thr Ser Gly Ala Val Gly Val Gly Arg Gin Arg Leu 
55 60 65 70 
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agg tac egg aag ctt gtc aat age age ttt get gat etg eaa aag eea 
Arg Tyr Arg Lys Leu Val Asn Ser Ser Phe Ala Asp Leu Gin Lys Pro 
75 80 85 



356 



cag atg gag tta gat gga aag get tgt gee get gtt ggt cag agt gga 404 

Gin Met Glu Leu Asp Gly Lys Ala Cys Ala Ala Val Gly Gin Ser Gly 

90 95 100 

p etg atg get ett tae gat atg ttg ttt aae eaa etg gat gte teg tea 452 

Jji Leu Met Ala Leu Tyr Asp Met Leu Phe Asn Gin Leu Asp Val Ser Ser 

P 105 110 115 

m 

y tet eaa ctt ett gtc ace gae agt gat ttt gag aac cca aag ttc egg 500 

i^* Ser Gin Leu Leu Val Thr Asp Ser Asp Phe Glu Asn Pro Lys Phe Arg 

fij 120 125 130 

H gag caa etc act gaa act gtt gag tea tta tta gat ctt aaa gtt ata 548 

Glu Gin Leu Thr Glu Thr Val Glu Ser Leu Leu Asp Leu Lys Val lie 

135 140 145 150 

cca ata ttt aat gaa aat gat gee ate age act aga aag get eea tat 596 

Pro lie Phe Asn Glu Asn Asp Ala He Ser Thr Arg Lys Ala Pro Tyr 

155 160 165 

gag gat tea tet ggt ata ttc tgg gat aat gac agt tta gea gga etg 644 

Glu Asp Ser Ser Gly He Phe Trp Asp Asn Asp Ser Leu Ala Gly Leu 

170 175 180 



23 



ttg gca ctg gaa ctg aaa get gat etc ett att ctg cte agt gat gtg 
Leu Ala Leu Glu Leu Lys Ala Asp Leu Leu He Leu Leu Ser Asp Val 
185 190 195 



692 



gat ggg ttg tat agt ggt cca cca agt gaa cca tea tea aaa ate ata 740 
Asp Gly Leu Tyr Ser Gly Pro Pro Ser Glu Pro Ser Ser Lys He He 
200 205 210 

ir cac act tat att aaa gaa aag cat cag caa gaa ate act ttt gga gae 788 

D His Thr Tyr He Lys Glu Lys His Gin Gin Glu He Thr Phe Gly Asp 

ru 

m 215 220 225 230 

m 

""^ aaa tct egt gta ggt aga gga gge atg aea gca aaa gtg aag get get 836 

Lys Ser Arg Val Gly Arg Gly Gly Met Thr Ala Lys Val Lys Ala Ala 

Jy 235 240 245 

''J 
Q 

hf> gtc ttg get tea aat age ggc aca cot gtg gtt att aca agt ggg ttt 884 

Val Leu Ala Ser Asn Ser Gly Thr Pro Val Val He Thr Ser Gly Phe 
250 255 260 

gaa aat egg age att ett aaa gtt ett eat ggg gaa aaa att ggt act 932 
Glu Asn Arg Ser He Leu Lys Val Leu His Gly Glu Lys He Gly Thr 
265 270 275 

etc ttt cac aag aat gcg aat ttg tgg gaa tea tct aag gat gtt agt 980 
Leu Phe His Lys Asn Ala Asn Leu Trp Glu Ser Ser Lys Asp Val Ser 
280 285 290 



2 4 



act cgt gag atg get gtt gcc gca aga gat tgt tea agg eat eta eag 
Thr Arg Glu Met Ala Val Ala Ala Arg Asp Cys Ser Arg His Leu Gin 
295 300 305 310 



1028 



aat ttg tea tea gag gaa cga aaa aag ata ttg eta gat gtt gea gat 
Asn Leu Ser Ser Glu Glu Arg Lys Lys He Leu Leu Asp Val Ala Asp 
315 320 325 



1076 



nj 
m 

H 



fU 

ru 

H 



get ttg gag gca aat gag gat tta ata agg tct gag aat gaa get gat 1124 
Ala Leu Glu Ala Asn Glu Asp Leu He Arg Ser Glu Asn Glu Ala Asp 
330 335 340 

gta get geg gcc caa gtt get gga tat gag aag cct ttg gtt get aga 1172 
Val Ala Ala Ala Gin Val Ala Gly Tyr Glu Lys Pro Leu Val Ala Arg 
345 350 355 

ttg act ata aaa cca gga aag ata gca age ctt gca aaa tct att cgt 1220 
Leu Thr He Lys Pro Gly Lys He Ala Ser Leu Ala Lys Ser He Arg 
360 365 370 



acc ctt gca aat atg gaa gae ect ata aae cag ata ctt aaa aag aca 
Thr Leu Ala Asn Met Glu Asp Pro He Asn Gin He Leu Lys Lys Thr 
375 380 385 390 



1268 



gag gtt get gat gat tta gtt ctt gag aaa aca tct tgc cca tta ggt 
Glu Val Ala Asp Asp Leu Val Leu Glu Lys Thr Ser Cys Pro Leu Gly 
395 400 405 



1316 



2 5 



gtt etc tta att gtt ttt gag tec cga cct gat gee ttg gtt eag att 1364 
Val Leu Leu lie Val Phe Glu Ser Arg Pro Asp Ala Leu Val Gin lie 
410 415 420 

gca tot ttg gea att cga agt ggt aat ggt ctt etc eta aaa ggt gga 1412 
Ala Ser Leu Ala He Arg Ser Gly Asn Gly Leu Leu Leu Lys Gly Gly 
425 430 435 

aaa gaa get ate aga tea aac acg ata ttg eat aag gtt ata act gat 1460 
Lys Glu Ala He Arg Ser Asn Thr He Leu His Lys Val He Thr Asp 
440 445 450 

get att cct cgt aat gtt ggt gaa aaa ctt att ggc ett gtt aca act 1508 
^ Ala He Pro Arg Asn Val Gly Glu Lys Leu He Gly Leu Val Thr Thr 

ni 455 460 465 470 

It 



1556 



aga gat gag ate gca gat ttg eta aag ctt gat gat gtc att gat ctt 
Arg Asp Glu He Ala Asp Leu Leu Lys Leu Asp Asp Val He Asp Leu 
475 480 485 



gtc act eca aga gga agt aat aag ctt gtc tct caa ate aag gcg tea 1604 
Val Thr Pro Arg Gly Ser Asn Lys Leu Val Ser Gin He Lys Ala Ser 
490 495 500 

act aag att cct gtt ett ggg eat get gat ggt ata tgc cae gta tat 1652 
Thr Lys He Pro Val Leu Gly His Ala Asp Gly He Cys His Val Tyr 
505 510 515 



2 6 



att gac aaa tea get gac atg gat atg gca aaa ctt att gta atg gat 
He Asp Lys Ser Ala Asp Met Asp Met Ala Lys Leu He Val Met Asp 
520 525 530 



1700 



gca aaa act gat tac cca gca gcc tgc aat gca atg gag acc tta eta 1748 
Ala Lys Thr Asp Tyr Pro Ala Ala Cys Asn Ala Met Glu Thr Leu Leu 
535 540 545 550 



gtt cat aag gat ett atg aag agt cca ggc ctt gac gac ata tta gta 
Val His Lys Asp Leu Met Lys Ser Pro Gly Leu Asp Asp He Leu Val 
555 560 565 



1796 



ry 

RJ 

H 
Q 



gca eta aaa aca gaa gga gtt aat att tat ggt gga cct att gcg cac 
Ala Leu Lys Thr Glu Gly Val Asn He Tyr Gly Gly Pro He Ala His 
570 575 580 

aaa get ctg gga ttt cca aaa get gtt tea ttt eat eat gag tat agt 
Lys Ala Leu Gly Phe Pro Lys Ala Val Ser Phe His His Glu Tyr Ser 
585 590 595 



1892 



tct atg gcc tgc act gtt gag ttt gtt gat gat gtt caa tea gca att 1940 
Ser Met Ala Cys Thr Val Glu Phe Val Asp Asp Val Gin Ser Ala He 
600 605 610 



gac cat att cat cgt tat gga agt get cat aca gat tgt ate gtc act 1988 
Asp His He His Arg Tyr Gly Ser Ala His Thr Asp Cys He Val Thr 
615 620 625 630 



2 7 



aca gat gat aag gta gca gag act ttt eta cgc aga gtt gat agt get 2036 
Thr Asp Asp Lys Val Ala Glu Thr Phe Leu Arg Arg Val Asp Ser Ala 
635 640 645 

get gta ttt eat aat gca agt aeg aga ttc tct gat ggg get cgt ttt 2084 
Ala Val Phe His Asn Ala Ser Thr Arg Phe Ser Asp Gly Ala Arg Phe 
650 655 660 

~- gga ttg ggt get gag gtt gge ata age aea ggg cgt ate eat gee cgt 2132 

p Gly Leu Gly Ala Glu Val Gly He Ser Thr Gly Arg He His Ala Arg 

I 665 670 675 

m 

gga cca gtg ggt gtt gaa ggt etc tta act aca cga tgg ate ttg cga 2180 

; Gly Pro Val Gly Val Glu Gly Leu Leu Thr Thr Arg Trp He Leu Arg 

680 685 690 

gga cgt ggg caa gtg gtg aat ggt gac aag gat gtc gtg tac ace cat 2228 
Gly Arg Gly Gin Val Val Asn Gly Asp Lys Asp Val Val Tyr Thr His 
695 700 705 710 

aag agt ett ect ttg caa tgaggtcaaa tgetcetttt agectgttca 2276 
Lys Ser Leu Pro Leu Gin 
715 

ggagtaggtg aatatccttt taagaatgga ttgaetactt tattttgtca tcttgtacaa 2336 
gcatcttatt gcggcattce gatggattat tgattttggg ggttcccaet ttcaaatgtg 2396 



2 8 



acaccaaaaa taaattcatc agttctgaga gcaagatttt ggaggttcag cttctccatg 2456 



taataagtaa attcagttct gagaacttgt gtaccaacgc gctatgttgc ttgtaatgag 2516 

cgatactaac atctgtgatt gcacatatac taa 2549 
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<400> 2 

ctgatattta ttttcttacc ttaaatacga cggtgcttca ctgagtccga ctcagttaac 60 



29 



tcgttcctct ctctgtgtgt ggttttggta gacgacgacg 



acgata atg gag gag 
Met Glu Glu 
1 



115 



eta gat cgt tea cgt get ttt gee aga gac gte aaa cgt ate gte gtt 163 

Leu Asp Arg Ser Arg Ala Phe Ala Arg Asp Val Lys Arg He Val Val 

5 10 15 

2 aag gtt ggg aca gca gtt gtt act gga aaa ggt gga aga ttg get ett 211 

Lys Val Gly Thr Ala Val Val Thr Gly Lys Gly Gly Arg Leu Ala Leu 

2} 20 25 30 35 

ggt cgt tta gga gca ctg tgt gaa cag ctt gcg gaa tta aac teg gat 259 

^ Gly Arg Leu Gly Ala Leu Cys Glu Gin Leu Ala Glu Leu Asn Ser Asp 

^.j 40 45 50 

p 

gga ttt gag gtg ata ttg gtg tea tet ggt gcg gtt ggt ett gge agg 307 

Gly Phe Glu Val He Leu Val Ser Ser Gly Ala Val Gly Leu Gly Arg 

55 60 65 

caa agg ctt cgt tat cga caa tta gte aat age age ttt gcg gat ctt 355 

Gin Arg Leu Arg Tyr Arg Gin Leu Val Asn Ser Ser Phe Ala Asp Leu 

70 75 80 

cag aag ect cag act gaa ctt gat ggg aag get tgt get ggt gtt gga 403 

Gin Lys Pro Gin Thr Glu Leu Asp Gly Lys Ala Cys Ala Gly Val Gly 

85 90 95 



3 0 



caa age agt ctt atg get tac tat gag act atg ttt gac cag ctt gat 451 
Gin Ser Ser Leu Met Ala Tyr Tyr Glu Thr Met Phe Asp Gin Leu Asp 
100 105 110 115 

gtg acg gca get caa ctt ctg gtg aat gac agt agt ttt aga gac aag 499 

Val Thr Ala Ala Gin Leu Leu Val Asn Asp Ser Ser Phe Arg Asp Lys 
120 125 130 

Q 

p gat ttc agg aag caa ctt aat gaa act gtc aag tct atg ctt gat ttg 547 

m Asp Phe Arg Lys Gin Leu Asn Glu Thr Val Lys Ser Met Leu Asp Leu 

m 135 140 145 

H 

H agg gtt att oca att ttc aat gag aat gat get att age acc cga aga 595 

PJ 

Rj Arg Val He Pro lie Phe Asn Glu Asn Asp Ala He Ser Thr Arg Arg 

^ 150 155 160 

gee cca tat cag gat tet tct ggt att ttc tgg gat aac gat age tta 643 
Ala Pro Tyr Gin Asp Ser Ser Gly He Phe Trp Asp Asn Asp Ser Leu 
165 170 175 

get get eta ctg gcg ttg gaa ctg aaa get gat ctt ctg att ctt ctg 691 
Ala Ala Leu Leu Ala Leu Glu Leu Lys Ala Asp Leu Leu He Leu Leu 
180 185 190 195 

age gat gtt gaa ggt ctt tac aca ggc cct cca agt gat cet aac tea 739 
Ser Asp Val Glu Gly Leu Tyr Thr Gly Pro Pro Ser Asp Pro Asn Ser 
200 205 210 



31 



ry 
ru 



aag ttg ate cac act ttt gtt aaa gaa aaa cat caa gat gag att aca 787 
Lys Leu He His Thr Phe Val Lys Glu Lys His Gin Asp Glu He Thr 
215 220 225 

ttc ggc gac aaa tea aga tta ggg aga ggg ggt atg act gca aaa gtc 835 
Phe Gly Asp Lys Ser Arg Leu Gly Arg Gly Gly Met Thr Ala Lys Val 
230 235 240 

aaa get gca gtc aat gca get tat get ggg att ect gtc ate ata acc 883 
Lys Ala Ala Val Asn Ala Ala Tyr Ala Gly He Pro Val He He Thr 
245 250 255 



931 



agt ggg tat tea get gag aac ata gat aaa gtc etc aga gga eta cgt 
Ser Gly Tyr Ser Ala Glu Asn He Asp Lys Val Leu Arg Gly Leu Arg 
260 265 270 275 



gtt gga acc ttg ttt cat caa gat get cgt tta tgg get ccg ate aca 979 
Val Gly Thr Leu Phe His Gin Asp Ala Arg Leu Trp Ala Pro He Thr 
280 285 290 

gat tct aat get cgt gac atg gca gtt get gcg agg gaa agt tec aga 1027 
Asp Ser Asn Ala Arg Asp Met Ala Val Ala Ala Arg Glu Ser Ser Arg 
295 300 305 

aag ctt cag gcc tta tct teg gaa gac agg aaa aaa att ctg ctt gat 1075 
Lys Leu Gin Ala Leu Ser Ser Glu Asp Arg Lys Lys He Leu Leu Asp 
310 315 320 



32 



att gcc gat gcc ctt gaa gca aat gtt act aca ate aaa get gag aat 
He Ala Asp Ala Leu Glu Ala Asn Val Thr Thr lie Lys Ala Glu Asn 
325 330 335 



1123 



gag tta gat gta get tot gca caa gag get ggg ttg gaa gag tea atg 1171 
Glu Leu Asp Val Ala Ser Ala Gin Glu Ala Gly Leu Glu Glu Ser Met 
340 345 350 355 

0 gtg get ego tta gtt atg aca cct gga aag ate teg age ctt gca get 1219 

01 Val Ala Arg Leu Val Met Thr Pro Gly Lys lie Ser Ser Leu Ala Ala 
^ 360 365 370 

si 

lA tea gtt cgt aag eta get gat atg gaa gat eca ate ggc egt gtt tta 1267 

tJj Ser Val Arg Lys Leu Ala Asp Met Glu Asp Pro He Gly Arg Val Leu 

"'i 375 380 385 

O 

aag aaa aca gag gtg gca gat ggt ctt gtc tta gag aag ace tea tea 1315 
Lys Lys Thr Glu Val Ala Asp Gly Leu Val Leu Glu Lys Thr Ser Ser 
390 395 400 

cca tta ggc gta ctt ctg att gtt ttt gaa tec cga cct gat gca ctt 1363 
Pro Leu Gly Val Leu Leu He Val Phe Glu Ser Arg Pro Asp Ala Leu 
405 410 415 

gta cag ata get tea ctt gcc ate cgt agt gga aat ggt ctt ctg ctg 1411 
Val Gin He Ala Ser Leu Ala He Arg Ser Gly Asn Gly Leu Leu Leu 
420 425 430 435 



33 



aag ggt gga aag gag gcc egg cga tea aat get ate tta cae aag gtg 
Lys Gly Gly Lys Glu Ala Arg Arg Ser Asn Ala He Leu His Lys Val 
440 445 450 



1459 



ate act gat gca att cca gag act gtt ggg ggt aaa etc att gga ctt 1507 

He Thr Asp Ala He Pro Glu Thr Val Gly Gly Lys Leu He Gly Leu 

455 460 465 

fi 

m gtg act tea aga gaa gag att cct gat ttg ctt aag ctt gat gac gtt 1555 

pjj Val Thr Ser Arg Glu Glu He Pro Asp Leu Leu Lys Leu Asp Asp Val 

k 470 475 480 

m 

H 

ate gat ctt gtg ate cca aga gga age aac aag ctt gtt act cag ata 1603 

He Asp Leu Val He Pro Arg Gly Ser Asn Lys Leu Val Thr Gin He 
485 490 495 

aaa aat act aca aaa ate cct gtg eta ggt cat get gat gga ate tgt 1651 
Lys Asn Thr Thr Lys He Pro Val Leu Gly His Ala Asp Gly He Cys 
500 505 510 515 



SI 



1699 



cat gta tat gtc gac aag get tgt gat acg gat atg gca aag cgc ata 
His Val Tyr Val Asp Lys Ala Cys Asp Thr Asp Met Ala Lys Arg He 
520 525 530 



gtt tct gat gca aag ttg gac tat cca gca gcc tgt aat gcg atg gaa 1747 
Val Ser Asp Ala Lys Leu Asp Tyr Pro Ala Ala Cys Asn Ala Met Glu 
535 540 545 



34 



acc ctt ctt gtg cat aag gat eta gag cag aat get gtg ctt aat gag 1795 
Thr Leu Leu Val His Lys Asp Leu Glu Gin Asn Ala Val Leu Asn Glu 
550 555 560 

ctt att ttt get ctg cag age aat gga gtc act ttg tat ggt gga cca 1843 
Leu He Phe Ala Leu Gin Ser Asn Gly Val Thr Leu Tyr Gly Gly Pro 
565 570 575 

O agg gca agt aag ata ctg aac ata cca gaa gca egg tea ttc aac cat 1891 



o 



Arg Ala Ser Lys He Leu Asn He Pro Glu Ala Arg Ser Phe Asn His 
580 585 590 595 

gag tac tgt gee aag get tgc act gtt gaa gtt gta gaa gac gtt tat 1939 
Glu Tyr Cys Ala Lys Ala Cys Thr Val Glu Val Val Glu Asp Val Tyr 
600 605 610 

ggt get ata gat cac att eac cga eat ggg agt gca cac aca gac tgc 1987 
Gly Ala He Asp His He His Arg His Gly Ser Ala His Thr Asp Cys 
615 620 625 

att gtg aca gag gat cac gaa gtt gca gag eta ttc ctt cge caa gtg 2035 
He Val Thr Glu Asp His Glu Val Ala Glu Leu Phe Leu Arg Gin Val 
630 635 640 

gat age get get gtg ttc cac aac gee age aca aga ttc tea gat ggt 2083 
Asp Ser Ala Ala Val Phe His Asn Ala Ser Thr Arg Phe Ser Asp Gly 
645 650 655 



3 5 



ttc cga ttt gga ctt ggt gca gag gtg ggg gta age acg ggc agg ate 2131 
Phe Arg Phe Gly Leu Gly Ala Glu Val Gly Val Ser Thr Gly Arg lie 
660 665 670 675 

cat get cgt ggt cca gtc ggg gtc gaa gga tta ctt aca acg aga tgg 2179 
His Ala Arg Gly Pro Val Gly Val Glu Gly Leu Leu Thr Thr Arg Trp 
680 685 690 

ata atg aga gga aaa gga caa gtt gtc gac gga gac aat gga att gtt 2227 
lie Met Arg Gly Lys Gly Gin Val Val Asp Gly Asp Asn Gly lie Val 
695 700 705 

tac acc cat cag gac att ccc ate caa get taaacaagac ttccgagtgt 2277 
Tyr Thr His Gin Asp He Pro He Gin Ala 
710 715 

gtgtttgtgt atttggttga gaettgagga gagacacaga ggaggatggg cttttttgtt 2337 

tcctctctgc ttagtactca tatcctatca ttattattat tactactact tattattgaa 2397 

aecctcgctt atgtagtggt tttgatttag ggttaggatt gcaccaaaaa taagatecac 2457 

tttaccactt agtcttgctc ataagtacga tgaagaacat ttaattagct tctcttcttg 2517 

tcattgtaag ctacctacac atttctgatc tttatcaaga tactactact tttc 2571 
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agcgtttaga aaaaaacagc gataaaaccg aaacatcaag caaacaaaaa aaaaagagaa 60 

gagaaattat ttttttttgt tttcgttttc aaaaacaaaa tctttgaatt tt atg gca 118 

Met Ala 
1 

ace cgt ctt etc cga aca aac ttt ate egg cga tct tac cgt tta ccc 166 
Thr Arg Leu Leu Arg Thr Asn Phe He Arg Arg Ser Tyr Arg Leu Pro 



3 7 



5 



10 



15 



get ttt age ccg gtg ggt ect ccc ace gtg act get tec ace gee gtc 214 
Ala Phe Ser Pro Val Gly Pro Pro Thr Val Thr Ala Ser Thr Ala Val 
20 25 30 

gtc ccg gag att etc tec ttt gga caa caa gca ccg gaa cea ect ett 262 
Val Pro Glu He Leu Ser Phe Gly Gin Gin Ala Pro Glu Pro Pro Leu 
35 40 45 50 

eac cae cea aaa cee ace gag caa tet eac gat ggt etc gat etc tee 310 
His His Pro Lys Pro Thr Glu Gin Ser His Asp Gly Leu Asp Leu Ser 
55 60 65 

gat caa gee egt ctt tte tec tet ate cea ace tet gat etc etc cgt 358 
'*j Asp Gin Ala Arg Leu Phe Ser Ser lie Pro Thr Ser Asp Leu Leu Arg 

°f 70 75 80 

tec acc gcc gtg ttg cat gcg gcg gcg ata ggt ect atg gtc gac eta 406 
Ser Thr Ala Val Leu His Ala Ala Ala He Gly Pro Met Val Asp Leu 
85 90 95 

ggg acg tgg gtc atg age tet aaa ctt atg gac get teg gtg aeg cgt 454 
Gly Thr Trp Val Met Ser Ser Lys Leu Met Asp Ala Ser Val Thr Arg 
100 105 110 

ggc atg gtt tta ggg ctt gtg aaa agt acg ttt tat gac cat ttt tge 502 
Gly Met Val Leu Gly Leu Val Lys Ser Thr Phe Tyr Asp His Phe Cys 



3 8 



115 



120 



125 



130 



O 

ru 

01 



gcc ggt gaa gat gcc gac gca gcc get gag cgc gtg aga age gtt tat 550 
Ala Gly Glu Asp Ala Asp Ala Ala Ala Glu Arg Val Arg Ser Val Tyr 
135 140 145 

gaa get act ggt ctt aaa ggg atg ctt gtc tat ggc gtc gaa cac gcc 598 
Glu Ala Thr Gly Leu Lys Gly Met Leu Val Tyr Gly Val Glu His Ala 
150 155 160 



gat gac get gta tet tgt gat gat aae atg caa caa ttc att cga acc 
Asp Asp Ala Val Ser Cys Asp Asp Asn Met Gin Gin Phe He Arg Thr 



646 



IP 

^1 165 170 175 



att gaa get gee aaa tet tta eea aea tet eac ttt age tea gtg gtt 694 
He Glu Ala Ala Lys Ser Leu Pro Thr Ser His Phe Ser Ser Val Val 
180 185 190 



742 



gtg aag ata act gcc att tgt cca att agt ctt ctg aaa cga gtg age 
Val Lys He Thr Ala He Cys Pro He Ser Leu Leu Lys Arg Val Ser 
195 200 205 210 



gat ctg ctg egg tgg gaa tac aaa agt ccg aac ttc aaa etc tea tgg 790 
Asp Leu Leu Arg Trp Glu Tyr Lys Ser Pro Asn Phe Lys Leu Ser Trp 
215 220 225 

aag etc aaa teg ttt ccg gtt ttc tec gaa teg agt cct etc tac cac 838 
Lys Leu Lys Ser Phe Pro Val Phe Ser Glu Ser Ser Pro Leu Tyr His 



39 



230 



235 



240 



aca aac tea gaa ccg gaa ccg tta acc gcg gaa gaa gaa agg gag etc 
Thr Asn Ser Glu Pro Glu Pro Leu Thr Ala Glu Glu Glu Arg Glu Leu 
245 250 255 

gaa gca get eat gga agg att caa gaa ate tgt agg aaa tgc caa gag 
Glu Ala Ala His Gly Arg He Gin Glu He Cys Arg Lys Cys Gin Glu 
260 265 270 

tee aat gta cca ttg ttg att gat gcg gaa gae aca ate etc caa ecc 
Ser Asn Val Pro Leu Leu He Asp Ala Glu Asp Thr He Leu Gin Pro 
275 280 285 290 

gcg ate gat tae atg get tat tea teg gcg ate atg ttc aat get gac 
Ala He Asp Tyr Met Ala Tyr Ser Ser Ala He Met Phe Asn Ala Asp 
295 300 305 

aaa gae cga cca ate gtt tae aac acg att cag gcg tac ttg aga gac 
Lys Asp Arg Pro He Val Tyr Asn Thr He Gin Ala Tyr Leu Arg Asp 
310 315 320 

gee ggt gag aga ctg eat ttg gca gta caa aat get gag aaa gag aat 
Ala Gly Glu Arg Leu His Leu Ala Val Gin Asn Ala Glu Lys Glu Asn 
325 330 335 

gtt cct atg ggg ttc aag ttg gtg aga ggg get tac atg tct age gaa 
Val Pro Met Gly Phe Lys Leu Val Arg Gly Ala Tyr Met Ser Ser Glu 



340 



345 



350 



O 



cgt age ttg gcg gat tec etg ggt tge aag teg eca gtc cac gac aca 1222 

Arg Ser Leu Ala Asp Ser Leu Gly Cys Lys Ser Pro Val His Asp Thr 

355 360 365 370 

att cag gat act cac tct tgt tac aat gat tgt atg aca ttc ctg atg 1270 

He Gin Asp Thr His Ser Cys Tyr Asn Asp Cys Met Thr Phe Leu Met 
375 380 385 



gag aaa gca tea aac ggt tct ggt ttc ggt gtc gtt etc gca aca cat 
1^1 Glu Lys Ala Ser Asn Gly Ser Gly Phe Gly Val Val Leu Ala Thr His 

390 395 400 



M 
fU 

pj aac get gat teg ggg aga ctt gcg teg agg aaa gcg agt gac etc ggg 1366 

Asn Ala Asp Ser Gly Arg Leu Ala Ser Arg Lys Ala Ser Asp Leu Gly 
P 405 410 415 



ate gat aaa cag aac ggg aag ata gag ttt gca cag eta tat ggt atg 1414 
He Asp Lys Gin Asn Gly Lys He Glu Phe Ala Gin Leu Tyr Gly Met 
420 425 430 

tea gat gca ttg tec ttc ggg tta aag aga gca ggg ttc aat gtt age 1462 
Ser Asp Ala Leu Ser Phe Gly Leu Lys Arg Ala Gly Phe Asn Val Ser 
435 440 445 450 

aag tac atg ecg ttt gga ecc gtc gca acc get ata ccg tat ctt etc 1510 
Lys Tyr Met Pro Phe Gly Pro Val Ala Thr Ala He Pro Tyr Leu Leu 



4 1 



455 



460 



465 



cga cgc get tat gag aac egg gga atg atg gcc acc gga get eat gac 1558 
Arg Arg Ala Tyr Glu Asn Arg Gly Met Met Ala Thr Gly Ala His Asp 
470 475 480 

cgt caa etc atg agg atg gaa ctt aag agg aga tta ate gee ggg att 1606 

Arg Gin Leu Met Arg Met Glu Leu Lys Arg Arg Leu He Ala Gly He 

g 485 490 495 

O 
r = 

bi gcg taaagagaga gtatggagec attaaatgaa attgggaaat gtagatgaat 1659 

1=^ aaatttcttc tatgtagttt aagaaattga aaaeaaaaaa ttataatata agaaatggag 1719 

taggtaagaa catttcctgt ggctaaatat ttttcatgag ggactatgtt tttactatca 1779 
atatatcatt cacaaatgta tattcacctt atcaataaaa atgcttttta cttt 1833 
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